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Abstract 



In speculative markets, risk-free profit opportunities are eliminated by traders 
exploiting them. Markets are therefore often described as "informationally 
efficient", rapidly removing predictable price changes, and leaving only residual 
unpredictable fluctuations. This classical view of markets absorbing information 
and otherwise operating close to an equilibrium is challenged by extreme price 
fluctuations, in particular since they occur far more frequently than can be 
accounted for by external news. Here we show that speculative markets which 
absorb mainly self-generated information can exhibit both: evolution towards 
efficient equilibrium states as well as their subsequent destabilization. This 
peculiar dynamics, a generic instability arising from an adaptive control which 
annihilates predictable information, is realized in a minimal agent-based market 
model where the impacts of agents' strategies adapt according to their trading 
success. This adaptation implements a learning rule for the market as a whole 
minimizing predictable price changes. The model reproduces stylized statistical 
properties of price changes in quantitative detail, including heavy tailed log 
return distributions and volatility clusters. Our results demonstrate that the 
perpetual occurrence of market instabilities can be a direct consequence of the 
very mechanisms that lead to market efficiency. 



Social systems self-organize. In consequence, collective behaviors can emerge that appear to 
pursue a common goal which is actually not present in the behavior of the single agents (111). 
The view that markets in fact operate in distinguished equilibrium states became influential 
in economics (J2|). Here, a fundamental hypothesis is that markets operate informationally 
optimal. That is, prices are assumed to "fully reflect available information" (J3I), or at 
least come close to this limit (Q), such that risk- free (arbitrage) profits cannot be made 
by (re-)using said information. If true, one of the implications of this "Efficient Market 
Hypothesis" (EMH) is that resulting prices fluctuate randomly (|5|). 

Empirical findings in favor of the Efficient Market Hypothesis include the general absence 
of exploitable autocorrelations among price changes in financial markets ((]). The magnitudes 
of price changes ("volatilities"), however, are correlated for long periods of time. That is, 
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large price changes are typically followed by large ones and small changes by small ones, 
indicating that markets are complex dynamical systems involving long memories 

a a. 

Furthermore, logarithmic price changes (log returns) exhibit large fluctuations even in the 
absence of external news (]9|, lldl ). contradicting naive expectations for systems in stable 
equilibria. More precisely, log return distributions exhibit heavy tails where events that are 
many times bigger than the standard deviation occur at a much higher frequency than what 
would be expected if they were Gaussian distributed (0, [ll|) (Fig. CD (c)). These so-called 
stylized facts of price time series hint at underlying dynamics that are substantially self- 
referential (112l ). They were associated either with herding effects (|13h . "bubbles", or with 
the interactions of heterogeneous traders with limited rationality (1141 ) in a market exactly 
at a critical point (1151). 

Here we investigate whether the apparent antinomy of stabilizing information efficient 
control and dynamics resembling systems operating close to criticality can be resolved by a 
recent non-economic theory (I16l ): It was shown that adaptive control of a dynamical system 
can generically lead to an instability where the susceptibility to noise dramatically increases 
close to the point of perfect balance. This principle applies to markets if two requirements 
are fulfilled: First, markets have to absorb information about predictable price changes. 
Ideally, this property should hold independently of the rationality of the individual traders, 
which cannot be guaranteed. Second, a self-referential market needs to become susceptible 
to residual noise once all locally relevant information has been exploited. 

The first property is a rather common view in economics. If a profit opportunity is present, 
investors will increasingly exploit it until their price impacts cancel said opportunity. For 
example, if a certain stock is priced too low, traders will increase buy orders and thereby raise 
its price. The second property is also intuitive. As traders try to detect trends or patterns in 
the price dynamics, they effectively predict how the market reacts to available information. 
However, once the agents' actions have led to a balanced equilibrium, it becomes increasingly 
difficult to distinguish predictable price fluctuations from random noise. If traders then act 
upon the random fluctuations as if they would hold meaningful new information, their 
actions will not be balanced anymore. That is, it may be impossible to predict whether a 
group of traders overreact to the supposed new information and to attenuate the resulting 
price jump by exploiting it. Therefore, atypically large price movements may become much 
more likely than expected for a Gaussian distribution. 
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The Model 



As a concrete example of the fundamental dynamical instability arising from information 
absorption as it may be realized in financial markets, we introduce a minimal agent-based 
trading model. Each agent is representative of one trading strategy and possesses two types 
of assets which are called money and stocks in the following. For simplicity, we consider a 
coarse-grained price time series where one step could be considered as e.g. a day. In each 
time step, every agent either buys or sells an amount of stocks. The fraction of an agent's 
money that is used to buy stocks or vice versa is denoted by the use parameter 7. At each 
time t, all trades are performed at the same price p(t) which is determined from the ratio of 
total demand and supply. Hence, trading per se conserves the total amounts of each asset, 
money and stocks. 

Agents base their decisions on public information states. In each time step t one of D 
possible states, which is denoted by an index /i(t) G {1, . . . , D}, is conveyed to the agents. 
Each agent's decision to either buy or sell is fixed in time for each fj, and assigned at random. 
We consider two different methods for the generation of these information states: 

For endogenous information, agents possess a memory of the most recent K signs of the 
log returns which indicate whether the prices pit — K), . . . ,p(t) decreased or increased with 
respect to their predecessor. This information can take one out of D = 2 K possible states. 

For exogenous information, are drawn randomly and independently with probability 
Pext(^)- Unless stated otherwise all /i have equal probabilities -P C xt(At) = l/D. We also 
investigated mixed information and obtained results similar to the endogenous case (see 
supplement). 

We focus on markets that are dominated by speculators who can win or loose assets only 
by betting on price changes within the market. To investigate the effect of a small number of 
traders that convey new assets to the market or draw out their profits, we divide the agents 
into N s speculators and N p producers. Producers' resources by definition stay constant. 
Thus, speculators only redistribute their wealth and are competitive whereas producers 
provide a predictable supply of liquidity and stocks. All agents are initially provided with 
equal amounts of assets. 



4 




FIG. 1: Analysis of log returns r(t + 1) = log p(t + 1) — logp(i), where p(t) is the price at time 
step t. (a): Time series of the trading model with uniformly distributed exogenous information. 
Parameters: N s = 2 10 , N p = 0, D = 2 9 , 7 = 0.8. (b): Time series for the same model, but 
with endogenous information, (c): Complementary cumulative distribution function of log return 
magnitudes obtained by rank ordering. Dotted black: The same simulation as in (a). Solid 
black: the same simulation as in (b). Dashed red: Daily returns for the Dow Jones Industrial 
Average (DJIA) since 1900. Short grey line: Hill estimator for the scaling exponent £ of the 
black curve, where large returns show power-law scaling P(\r\ > x) oc x~^. The cutoff optimizes 



the Kolmogorov-Smirnov statistic (jlq ). For a fair comparison, 3 • 10 time steps after transients 
were analyzed for the simulations to match the length of the Dow Jones time series. Each time 
series was normalized by dividing by its standard deviation, (d): Autocorrelations of the log- 
return magnitudes \r\. Line styles are identical to (c). Positive autocorrelations for the model with 
endogenous information, and the DIJA persist for more than 2 years. 

Stylized Facts 

Log-return time series for the model with exogenous information are shown in Fig. [1] (a). 
A strong reduction of initial fluctuations is observed leaving only a narrow band of Gaussian 
distributed returns after the transient. Fig. [Q (b) shows the endogenous case. Here, in 



contrast, initial return magnitudes are reduced only in the mean. The magnitudes of the few 
most extreme returns, however, are less reduced. The remaining fluctuations are analyzed in 
Fig-ffl(c), where cumulative distributions of return magnitudes are shown for both cases and 
compared to the Dow Jones Industrial Average (DJIA). The latter serves as an example for 
a typical price time series. For the endogenous case, the distribution tail is well described by 
a power-law and in good agreement with the DJIA. Return fluctuations in the endogenous 
case also tend to form clusters in time. This effect is quantified by long-range temporal 
correlations of return magnitudes shown in Fig. [1] (d) and is also consistent with the DJIA. 

Dynamics with Exogenous Information 

To understand the model dynamics, we first consider the exogenous case which is fully 
analytically tractable. The rules of asset redistribution by trading are equivalent to a learning 
rule related to gradient descent, allowing for the market as a whole to minimize predictable 
price changes. The reason for this stabilizing control is that trading success increases the 
impact of agents whose actions contribute to a reduction of price fluctuations. A phase 
transition with respect to the critical parameter a = D/N s is identified at a = 1/2, the 
point where random binary vectors (the agents) with positive weights (the assets) form a 
complete basis in the D-dimensional strategy space in the limit N s — > oo. Beyond this point, 
a speculative market without producers evolves the distribution of assets onto a manifold 
where the price is invariant to trading. That is, agents still trade and exchange assets, but 
the price remains constant. Markets that include producers still exhibit finite returns also 
for a < 1/2. Otherwise, for N p <C N s , return distributions only depend weakly on N p . See 
Fig. El and the supplementary material for more details. 

Information Annihilation Instability 

When the fj,(t) are endogenously generated, the same mechanism of information absorp- 
tion present in the exogenous case ensures that the system relaxes towards local price equi- 
libria and returns vanish, but only transiently. Fig. [2] (a) shows the price time series for a 
simulation with a very small use 7 which formally is a learning rate. At any point in time, 
the system moves towards a certain price which characterizes a local equilibrium. As the 
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FIG. 2: (a): Price time series with a very slow rate of resource redistribution (use) 7 = 0.01. Other 
parameters: N = 2 10 , N p = 2 4 , D = 2 9 . (b): A zoom in on the time series shown in (a). 



system approaches this equilibrium, price fluctuations are reduced. These fluctuations gen- 
erally consist of complex oscillations like the one shown in Fig. [2](b). The equilibria become 
unstable once all predictable information is exploited by the speculators. Then, even little 
overshooting of the adaptation dynamics or noise can lead to price changes corresponding 
to information states that were not predicted by patterns in the immediate past. Because 
the market is not well-adapted to these new states, the possibility of large price changes 
increases dramatically. Compared to (1161). we here discovered an instability due to informa- 
tion annihilation in a mathematically different way, which demonstrates that this concept 
is even more general. 

For larger 7, this behavior is not as obvious: time series appear random and distinct 
oscillations are rarely visually recognizable (Fig. Q](b)). Still, phase diagrams from extensive 
simulations demonstrate that return distributions are largely unaffected by these effects over 
wide ranges of 7: Fig. |3] (a) shows how much initial mean log return magnitudes are reduced 
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(a) Return Attenuation 




Use 7 

FIG. 3: (a): reduction of average return magnitudes during transients and (b): kurtosis of log- 
returns of the model with endogenous information for different a vs 7. A kurtosis of 3 corresponds 
to a normal distribution. Here, the system size is set constant at N s = 2 10 speculators and N p = 2 4 
producers. For each time series, 2 • 10 7 time steps were simulated. Reductions are measured as 
the ratio between the mean log-return magnitudes during the first 10 and the last 10 7 timesteps. 
The kurtoses were calculated for the last 10 7 timesteps. Simulations were performed on a grid, all 
axis ticks correspond to node positions. For each node on the grid, 50 time series were analyzed 
and results were averaged. Values in between nodes were obtained by linear interpolation after 
logarithmizing the values at each node. Color mapping was performed on the log-values, contour 
line labels are the actual values for variances and kurtoses. For a < 1/2, the kurtoses reach extreme 
values that can not be reliably estimated from finite time series. Therefore, the color scale in (b) 
was set to not extend to values above 100. 

for different a and 7; Fig. |3] (b) shows the impacts that infrequent extreme returns have 
on the remaining variances, which are measured by kurtoses after transients. The stronger 
the reduction of return magnitudes (Fig. |3] (a)), the heavier tailed the return distributions 
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are (Fig. [3] (b)). Therefore, a clear link between information annihilation and extreme 
returns is established in our model for the whole parameter space. This relation between 
return reduction and kurtosis is not found for exogenous information drawn from a uniform 
distribution (Fig. Q](b), and supplement). 



Surprising Information Causes Large Returns 

We find that large returns are caused by information states that have not occurred for a 
long time. Intuitively, such states are more surprising and therefore carry more information 
than the ones visited more recently - a concept that is closely related to Shannon informa- 
tion. The more surprising an information state is, the higher the corresponding log return. 
The correlation between log return magnitudes and the times r that have passed since the 
respective information states occurred last is shown in Fig. H] (solid line) for endogenous 
information. Here, the absorption of local information in combination with rare jumps leads 
to a strongly inhomogeneous distribution of visiting frequencies over the information set: 
the probability distribution P(r) is power-law tailed with an exponent of approximately 2.5 
(see supplement). This suggests that the self-reflexive dynamics for endogenous informa- 
tion generates a characteristic distribution of information states that ultimately underlies 
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FIG. 4: Correlation of return magnitudes \r\ with the time r since the corresponding information 
states occurred last. Model parameters: D = 2 10 , N s = 2 , N p = 0, 7 = 0.5. Simulation length: 
T = 2 • 10 7 ; the first T/2 timesteps were discarded for the analysis. Black line: Endogenous 
information. Dashed line: Exogenous information with P e xo(^) oc exp(— 0.02^), leading to P(t) oc 
t -2 . Both lines are averages over 10 simulations. 
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extreme price fluctuations. We tested this hypothesis by using inhomogeneously distributed 
exogenous information states leading to similarly distributed r. Then, as in the endogenous 
case, return magnitudes are strongly correlated with r (Fig. HJ dashed line). 



Discussion 

Our model demonstrates that even a simple yet plausible mechanism of order size adap- 
tation suffices to coordinate the impacts of diverse strategies such that information becomes 
completely absorbed in the price. The proof that the trading rules in our model correspond 
to an efficient gradient based learning rule that minimizes predictable return magnitudes 
provides a rigorous link of a fundamental market mechanism to adaptive control. We tested 
variations of this mechanism including adaptations of individual agent use parameters, and 



order-book based pricing (1171 ): information absorption proved to be a robust property. We 
therefore consider it highly plausible to prevail in real markets despite the complexities of 
real pricing mechanisms and order size adaptations, as long as the latter correlate with 
trading success. 

"Rationality", in the sense of an efficient adjustment of prices to new information, in our 
model emerges from the collective behavior of many traders. We find that a minority rule 
with respect to the returns is a dominating factor in its dynamics. That is, traders whose 
actions counteract those of the majority profit most. 



In the well known minority game (1151 ) (MG), however, adaptation is based on the choices 
of the participating agents, and a single-step payoff with respect to the excess demand. This 
differs from our model where the market adapts traders' impacts based on price changes, 
and also over different time scales. Most importantly, the mechanism for herding previously 
discussed for MGs is a breakdown of the efficient coordination of agents in overcomplete 
markets where they become too correlated. This leads to an increase of average fluctuations 
and does not depend on how the information conveyed to the agents is generated. 

In contradistinction, the instability due to adaptive control is independent of microscopic 



interactions and in fact was first realized in a one-dimensional random map (I18l). The 
heavy tailed distributions here are a direct consequence of the elimination of local trends or 
patterns which yields a net decrease in average fluctuation magnitudes. These fluctuations 
are therefore a sign of high efficiency and do not signal its breakdown: they reflect surprising 
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information. Furthermore, a closed loop involving endogenous information is essential for 
creating this instability leading to extreme fluctuations that are not caused by external 
events. Therefore, the results presented in this paper are complementary to the published 
findings based on MGs in several respects. It is, however, possible to formulate a minority 
game corresponding to our model, but that would go beyond the scope of this paper. 

The fact that efficient information annihilation does not result in a unique and stable 
equilibrium, but instead can lead to local states that perpetually become unstable, provides 
a new and comprehensive solution to a long standing antinomy in economics where both no- 
tions have been considered mutually exclusive (114J). Furthermore, our adaptive agent-based 
model not only demonstrates the theoretical principle of information annihilation instability, 
but can reproduce important "stylized facts" of financial markets also quantitatively to a 
surprising extent. The parameter sets that best reproduce the data correspond to situations 
where agents on average profit most (see supplement). This is plausible because real traders 
can choose among different markets. These findings suggest that the information annihila- 
tion instability is indeed one of the main causes of the notorious large jumps in price time 
series, particularly in strongly speculative markets. An empirical confirmation of our theory 
would require identifying the information states that cause large price changes in a given 
market, for which it then makes directly testable predictions. 
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I. SUPPLEMENTARY METHODS 



A. Model 

Each agent % = 1, . . . , N possesses two types of assets which without loss of generality are 
called money Mj(t) and stocks Si(t). N is the total number of agents. In each round, every 
agent places a market order to either buy or to sell an amount of stocks. Since trading 
should conserve the amount of traded assets, the price in each round is determined by: 



with demand 8 and supply of stocks q. This is a fair rule that could be used in a real market 
with only market orders. 

For a market including stochastic limit orders gathered over some period of time, consider 
the hypothetical price p*(t) at which trades would take place if all agents scaled their orders 
by a common factor. Then the volume would change, but to preserve market clearing the 
price is not affected; that is pit)* = pit). Therefore, the price is a function of the ratio of 
demand and supply. After linearization of this function for small small excess demands the 
price is proportional to the aforementioned ratio which justifies this choice of the pricing 
rule also as an approximation of the mean prices obtained from limit orders. 

Agents base their decisions on a public information state. In each time step one of D 
possible states, which is denoted by an index /i(t), is conveyed to the agents. We distinguish 
two different methods for the generation of these information states at each time step t: 

For exogenous information, /i(t) are independent identically distributed random variates 
drawn from a distribution P e xt (/•*)■ 

For endogenous information, agents possess a memory of the most recent K signs of the 
log returns. To eliminate the possibility of a lock, signs for vanishing returns are chosen at 
random. This information can take one out of D = 2 K possible states: 



where 9 denotes the Heaviside step function and 77 is an arbitrarily small symmetric random 
variable with zero mean. Simulation results do not depend on Var(r/) as long as it is small 
enough. 




mod D 



(2) 
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Each agent's i consequent decision is now determined by a strategy vector whose elements 
of are drawn randomly out of {0, 1} once and then kept constant. These two possible deci- 
sions correspond to trading an amount rrii{t) of money or an amount Sj(t) of stocks for the 
respective other asset in the next round. Orders are placed with a constant use parameter 7: 

Case of® = 1 (agent % buys stocks): 

m t (t) = 1 M i (t) (3) 
Si(t) = 

Case of® = (agent i sells stocks): 

mi(t) = 

Si(t) = <ySi(t) (4) 
Demand and supply are the sums of all buy and sell orders respectively 

N 

5(t) = ^m t (t) + e (5) 

i=\ 
N 

S(t) = 5>(t) + e (6) 

i=l 

where e is a small positive number. This ensures that prices and returns are always well 
defined. The cases with zero demand or supply are, however, irrelevant for all practical 
purposes. A sufficiently small e 1CT 3 does not influence simulation results to a meaningful 
degree. All figures were generated using e = 1CT 10 . 

We investigate the effect of a market ecology by dividing the agents into N s speculators 
and N p = N — N s producers. We focus on markets that are dominated by speculators whose 
resources are redistributed due to trading: 



M k (t + 1) = M k {t)-m k {t) + s k {t) P {t) 
S k (t + 1) = S k (t) - s k (t) + m k (t)/p(t) 



N p < k <= N. (7) 



Producers' resources stay constant throughout the game: 

> 3<N P . (8) 



M^t) = M,(0) 
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Thus, only speculators redistribute their wealth and are competitive whereas producers 
provide a predictable supply of liquidity and stocks. All agents are initially provided with 
equal amounts of resources Mj(0) = Si(0) = 1. 



B. Invariant Manifold 

We show, that if one distribution of resources (M, S) = (M 1; . . . , Mjv, Si, . . . , Sn) exists 
for which the price p[M, S,fi) = p is independent of the information \i, this price is invariant 
with respect to any resource redistribution due to trading in a purely speculative market. 
That is, there is a manifold Q = {(M , S ) | p(M , S , /i) = p V jj} of distributions of stocks 
for which the price is independent of fi and this manifold is closed with respect to trading 
according to Eq. (jTJ). For the proof, assume that at some point in time the system is in a 
suitable state such that 

^§4 = PVM^ (9) 

5(M,fi)-p<;(S,fi) = 

N 

tX) WM<-p(l V //(*)■ (10) 

1=1 

Then, denoting the distributions of stocks and money after trading by M i and S t we obtain: 

1 N N 

- (5(1',/) - ^(5',/)) = ^ ff f s; - - .f) s; (ii) 

7 j=l i=l 

N 

= J2 a i' (Mi-jatMi + jpil-a^Si) 

i=l 

-pD 1 " <') ^ - 7 (1 - $ + ~ < M t (12) 
i=l \ V J 

N N 

= E (<' ^ - ^ (1 - <') 3) -T^KI.-pfl- erf) 5.) (13) 

1=1 ' 8=1 

= 0-0 = (14) 

□ 
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FIG. S5: Average log-return variances for different values of a = D/N s . The model with exogenous 
information and only speculators serves as a reference. For comparison, simulations with a small 
numbers of either deterministic (N p = 2 ) or random producers (N n = 2 4 ) are shown as well 
as as the model with endogenous information. Grey area: Analytical upper and lower limit for 
exogenous information. Dark gray line: heuristic interpolation. 

C. Completeness of the Strategies 

As shown above, a sufficient condition for complete suppression of all price changes is 
finding a resource distribution (M, S) for which the price is independent of the information. 
That is, 



which is equivalent to Eq. ([TO]) . To fulfill this criterion, we need enough agents to form a 
complete basis in the strategy space which has D dimensions. Then, the deviation from p 
caused by each agent can be canceled by a superposition of the other agents for every /i. 
This can be guaranteed if the number of speculators N s exceeds 2D. 

For an insufficient number of speculators, we can still calculate an upper and a lower 
bound for the variance of the log returns given D and A" for a perfect superposition of 
speculators with exogenous information. Numerical and analytical results for this case are 
shown in Fig. IS5l The mean variance is found to drop dramatically at a = D/N s = 1/2, 
with an increasingly sharp transition for large D. This phase transition can be understood 
by considering the probability that a random binary vector can be canceled by an optimal 
superposition of A^ — 1 random binary vectors with positive weights. As an interim step, 



p{/i,M,S) = p V \i 



(15) 



v 



consider superpositions of random vectors with arbitrary weights. One such vector creates 
a one dimensional subspace. Adding a second vector expands the dimensionality of the 
subspace to d 2 — 2 if it is linearly independent of the first one. Adding further vectors one 
by one, the probability that the ith vector does not lie in a c^i-dimensional submanifold is 

P(d i = d i _i + l) = l-2 i? - di - 1 . (16) 

We can therefore iteratively calculate the probability distribution P(d,N a -i) of d after adding 
N s — 1 agents and the probability 

D 

P(d Ns = d Ns -, + l) = J2P(d = d Ns ^) (1 - 2 d - D ) (17) 

d=i 

that one out of N s agents is linearly independent of the others. If a vector is linearly inde- 
pendent of the other agents in d dimensions, it cannot be canceled by a linear combination 
of the other agents for all /i. However, it may still be possible to cancel this agent's impact 
for a subset of all possible /x, i.e. for a smaller subspace. Therefore, the probability that an 
agent cannot be canceled in any given time step is 

Pec. = = d*.-i) (1 - 2 d -°) (1 - ^)- (18) 

d=l 

The last term weights each summand with the fraction of dimensions in which the agent's 
impact is not canceled. Finally, to relate the fraction of not canceled agents to returns we 
need to consider the fluctuations prior to any resource redistribution. Since all strategies and 
jj, are chosen randomly, agents initially contribute to the demand or the supply at random. 
These fluctuations of demand and supply then follow a binomial distribution with N s trials 
and equal probability for buying or selling: 

5 oc B(N S , 1/2) (19) 
q<xB(N 8 ,l/2). (20) 



Since 



(5) = (?) = N s /2, and (21) 
Var(5) = Var(<;) = N s /4 (22) 



we can approximate the price for small deviations: 



N/2 + AS A5-As 

"=m^ xl+2 ^^- (23 > 
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Therefore, 



Var(p(0)) « 4/JV f 



(p(0)> = 1 



(24) 
(25) 



and finally 



Var(r(0)) 



(r(0)> = 0, 



iV s ln(10) 



8 



2 ' 



(26) 
(27) 



where the logarithm stems from using the logarithm with base 10 in the log return. 

Combining Eqns. (fT5j) and fT27|) . we obtain the expected variance of the return for an 
optimal superposition of agents without the positivity constraint on the resources 



Since resources cannot be negative, they form a positive cone. Each agent thatlocalized is 
linear independent of the others spans a half space. Therefore, 2N S agents are necessary 
to completely span the strategy space. Yet for small numbers of agents, each agent still 
represents a full degree of freedom since the probability that two agents lie on the same 1- 
dimensional submanifold is vanishingly small. However, as the number of agents is increased 
such that a — > 1, an increasingly large number of new agents just converts a halfspace into a 
full one. Therefore, Eq. (1251) represents a lower limit for the variance of the log returns which 
is a good description for N s ^ D. An upper limit is obtained by changing Eq. (1151) such, 
that each agents increases dby 1/2. This is a good approximation for N s « 2D. The area in 
between these limits is shown in Fig. [S5] (gray shaded). The lower limit has a phase transition 
at a = 1 while the upper limit has a phase transition at a = 1/2. A phase transition at a = 1 
is already present in Eq. (117]) . The gradual convergence for the true variance of the system 
from the lower to the upper limit is captured by a simple heuristic interpolation: For the 
dark gray line in Fig. [S5l the probability for a new linearly independent agent to increase d by 
one is Pi = min( 1, iV s /2 m+1 ) while the probability to increase d by 1/2 is P\/% = 1 — p\. The 
presented theory describes the numerical results (Fig. 1S5|) for the model with endogenous 
information very well for a < 1/2. For full markets, the residual error for simulations with 
only speculators is determined by the numerical precision. When producers are present, the 
residual error is noticeably higher. This is due to the fact, that producers push the system off 



Var(r n . c .) = Var(r(0)) P cx .. 



(28) 
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the invariant manifold. This error is dependent on the agents' use and vanishes for small 7. 
Still, predictable producers are canceled much better than random ones because speculators 
can successfully predict their choices. For endogenous information (D = 2 K ), the phase 
transition appears smother and slightly shifted towards larger a. A stronger reduction of 
average returns for a < 1/2 occurs due to the more localized adaptation: Agents do not 
adapt to all possible values of ji at the same time. 

D. Gradient Descent 

We now investigate, how the system evolves towards the invariant manifold. We focus 
on large numbers of agents and small 7. The resource redistribution due to subsequently 
trading the two assets for one another is found to be a special case of a learning rule which 
minimizes log-return magnitudes. Even more generally, we consider the error function 



with respect to the agents' resources is dominated by terms with the opposite sign as the 
change in the agents' resources. Therefore, any scaling of the agents' resources which keeps 
the sign of the return for money and the opposite sign for stocks corresponds to minimizing 
log return magnitudes similar to a gradient descent. 

To begin with, consider two subsequent time steps where the information takes the values 
jj, and //. We again consider a market consisting of speculators only. The derivative of the 
return with respect to the resources of an agent k is 



(29) 



and show that its gradient 



(30) 



<9r(M, S,/i,n') 
dMk 




f + 0(7), 
!-<>{ 



(31) 



dr(M,S, 
dS k 




+ 0(7), with 



(32) 





(33) 
(34) 
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The change in resources after trading twice is 

AM k = M'l — M fc = 7 (S k ({l - <)p + (1 - o{)j) - M k {4 + oft) + 0( 7 2 ) (35) 

AS fc = S>> - S k = 7 (M fc ^ + - ^(2 - < " j + 0(7 2 )- (36) 
We are interested in 

Ar '=( AM ^ +AS '^) (37) 

and continue only with leading terms in 7. 

For now, we also assume that agents can only perform roundtrip trades (RT). The general 
case will be discussed later. Two cases are left: 
Case<(l-<') = 1: 

Arf T 7 «i M k -p'S k M k /p-S k 



7 S q' 



(38) 



Case al (1 - a£) = 1: 



Arf T 7 <i pS k - M k + M k /p' - S k (4Q) 



7 5' 



6' \ ? 6 

Above, we used p = 5/q and p' = 8'/q'. Then, 

N 



+ 1} . (41) 



Iy>f = M + l-T (42) 




Therefore, the change in the total error function 

A / dr 2 dr 2 \ RT 

g ( AM 'M4 + AS ^) £ ° (44) 

can never be positive if agents only perform roundtrip trades. 

On average, this result holds even for the general case. The reason why we have to 
consider averages is, that agents who buy or sell two times in a row always decrease the 
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amount of money or stocks they own after two time steps. Therefore, these agents' resources 
are expected to change in the opposite direction of the gradient half of the time. That is, 
for every given pair of informations (//, //), a quarter of all agents' is expected to have their 
resources evolve such that future r(/j,,/j,') 2 increase. However, the actual influence of these 
agents is much lower and can be neglected for large systems. Then, demand and supply can 
be well described as binomial processes as shown above. Here, express demand and supply 
as: 



5= f + AS, 



(A8) 
(A?> 



0. 
0. 



(A, 2 ) < ? 



(45) 
(46) 



and the relative fluctuations around the mean demand N/2 are only yN/2 and therefore 
small for large N. Thus we can expand Eq. (1311) for small fluctuations: 



<9r(M, S,fx,(x') 

dM k 
dr(M, S, (i, fi') 

dSl. 



'2 


4A5' 


N 


N 2 


<) 


(-- 




\N 



- (J 



4AA 
IP) 



In 2 



2 

N 



(1-^)U7 



4A^' 



AT A^ 2 



(47) 
(48) 



As the above equation shows, when agents perform roundtrip trades, they contribute a term 
of order A^ -1 to the gradient with respect to each asset. When agents buy or sell twice, 
they only contribute a term of order A^ -L5 for one asset. Therefore, the influence of these 
agents vanishes for sufficiently large N. By a similar argument, approximately a quarter of 
all agents performs either one of the actions (buy, sell), (sell, buy), (buy, buy), and (sell, 
sell) while fluctuations can be neglected for large N. Summing up, the expected change in 
r(/x,/i') 2 over repeated trades with the same information 



- / 

E AM ' 



,fe=i 



dr 2 dr 2 
oM k 




< 0. 



(49) 



is always negative given a sufficiently large number of agents. 
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II. SUPPLEMENTARY FIGURES 



A. Phase Diagrams for Exogenous Information 

For uniformly distributed exogenous information states, a reduction of fluctuations does 
not generally increase the kurtosis (Fig. |S6|) . This is opposed to the endogenous case shown 
in Fig. [3] in the main paper. Only very large uses in overcomplete markets cause high 
kurtoses. Then, even uniformly distributed fi occasionally are not repeated for a sufficiently 
long time to be "forgotten" by the market. 



(a) Return Attenuation (b) Return Kurtosis 




0.01 0.1 1 0.01 0.1 1 

Use 7 Use 7 



FIG. S6: (a): Reduction of average return magnitudes during transients, and (b): kurtoses for 
the model with exogenous /i drawn from a uniform distribution. All other parameters and figure 
generation are identical to Fig. [3] in the main paper. 
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B. Speculators and Producers 

Fig. IS7l shows the phase diagram for a versus the amount of producers in the market. As it 
turns out, a second phase transition with respect to the number of producers is found. This 
transition is independent of the one for the speculators. Small N p < 0.5 • D only weakly 
influence return distributions. 



4 

@ 




102.4 21.3 4.6 1.0 0.2 102.4 21.3 4.6 1.0 0.2 



D/N p D/N p 

FIG. S7: (a): Reduction of average return magnitudes during transients, and (b): kurtosis of 
log-returns of the model with endogenous information for different numbers of speculators N s and 
producers N p for constant memory K = 10 (i.e. D = 2 10 ) and use 7 = 0.5. 
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C. Distribution of Information Ages (Surprise) 




FIG. S8: Complementary cumulative distribution of the times r(t) since the informations /j,(t) 
occurred last. Solid black line: Model with intrinsic information and D = 2 10 , N s = 2 11 , N p = 0, 
7 = 0.5. Short red line: Hill estimator. Dashed line: Exogenous information with P exo (fi) oc 
exp(— 0.02/i), leading to P(t) oc t -2 . 
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D. Mixed Information 

For a combination of endo- and exogenous information, results are similar to pure endoge- 
nous information as long as the endogenous part dominates. Generally, more exogenous 
information leads to a stronger reduction of fluctuations, less pronounced volatility cluster- 
ing, and random time series without visible patterns even for small 7. The scaling of the 
remaining extreme returns remains unchanged. An example is shown in Fig. [S9] 
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FIG. S9: Simulation for a model with 3 bits of uniform exogenous information and 6 bits of 
endogenous information. N s = 2 10 , N p = 0, 7 = 0.1. The first 2 7 time steps were discarded. Log 
returns r n are normalized by their standard deviation, (a): Time series (b): Solid black line: 
complementary cumulative distribution function. Short Red line: Hill estimator for the scaling 
exponent. Dashed grey line: normal distribution. 
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III. SUPPLEMENTARY DISCUSSION: INCOME AND THE CRITICAL POINT 



Fig. IS 101 shows the phase transition with respect to a in more detail. As in Fig. IS5j we 
take one parameter set as a reference to which we compare simulations after transients 
for different parameters. For orientation, the log-return variances (Fig. IS 101 (a)) and kur- 
toses(Fig. IS 101 (b)), which have been discussed earlier, are shown again. 
Mean speculator capitals 



are not constant over time in markets that include producers. For empty markets, the 
ratio of average speculator and producer capitals quickly evolves towards an equilibrium. 
The more agents are added, the longer it takes for C s to saturate. For critical or crowded 
markets a positive speculator income persists over long times. Then, average speculator 
capitals after transients can be well described as: 



The income factor a is shown in Fig. IS10I (c) and quantifies how well the speculators can 
exploit the producers, a is found to be independent of the initial ratio between speculator 
and producer capitals. It becomes maximal close to the critical point which can be intu- 
itively understood: For empty markets, there is a finite chance for a producer strategy to 
lie outside of the space spanned by the speculators. Therefore, increasing the number of 
speculators increases their average income. For crowded markets, producers are already op- 
timally exploited. Then, adding more speculators just distributes the maximal total income 
over more of them. An analogous maximum can be found in Minority Games (see e.g. ref. 
15 in the main paper). 

Fig. IS 101 (d) shows the gini index, a common measure of wealth inequality. Increased 
incomes coincide with increased capital inequality among speculators as the gini-index shows 
a maximum at the critical point. Thus, only few speculators are most successful in exploiting 
the producers. 
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FIG. S10: Properties of log-return and resource distributions depend on the parameter a = D/N s . 
Solid line: The model with unity (initial) speculator and producer capitals C s and C p and a small 
number of producers serves as a reference. Shown are averages from 50 simulations for each of 
which only the last T = 10 out of 2T timesteps have been analyted. (a): variances and (b): 
kurtoses were calculated from log-returns, (c): income factors a according to Eqns. (|50p and f)5 1 1) . 
(d) gini indices for speculator capitals after 2 T timesteps. For each other line or symbol, only one 
model parameters has been changed, respectively. 
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